Evaluation of machine translation by reading comprehension tests and subjective judgments
نویسنده
چکیده
This paper discusses the results of an experiment designed to test the quality of translations, in which human subjects were presented with IBM-produced machine translations of several passages taken from the Russian electrical engineering journal Elektrosviaz, and with human translations of some other passages taken from Telecommunications, the English translation of Elektrosviaz. The subjects were tested for comprehension of the passages read, and were also asked to judge the clarity of individual sentences. Although the human translations generally gave better results than the machine translations, the differences were frequently not significant. Most subjects regarded the machine translations as comprehensible and clear enough to indicate whether a more polished human translation was desirable. The reading comprehension test and the judgment of clarity test were found to give more consistent results than an earlier procedure for evaluating translations, since the questions asked in the current series of tests were more precise and limited in scope than those in the earlier scries.
منابع مشابه
The Relationship between Translation Tests and Reading Comprehension: A Case of Iranian University Students
The present study seeks to investigate the potentiality of the translation task as a testing method for measuring reading comprehension. To achieve this objective, two types of translation tests, open-ended and multiple-choice tests, and two types of reading comprehension tests, multiple-choice reading comprehension and open-ended cloze tests were developed in this study. The reliability of the...
متن کاملFast, Cheap, and Creative: Evaluating Translation Quality Using Amazon's Mechanical Turk
Manual evaluation of translation quality is generally thought to be excessively time consuming and expensive. We explore a fast and inexpensive way of doing it using Amazon’s Mechanical Turk to pay small sums to a large number of non-expert annotators. For $10 we redundantly recreate judgments from a WMT08 translation task. We find that when combined non-expert judgments have a high-level of ag...
متن کاملVocabulary Instruction Method and Specialized Reading Comprehension: Build a Bridge or Wash it away
The present study aimed to examine and compare the impact of teaching economic terms through etymological elaboration with three more conventional methods of vocabulary instruction in ESP courses in Iran, that is, teaching through contextual definitions, L1 translation, and implicit instruction on the learners' general comprehension of economic texts and their understanding of author's opinion....
متن کاملA Reading Comprehension Corpus for Machine Translation Evaluation
Effectively assessing Natural Language Processing output tasks is a challenge for research in the area. In the case of Machine Translation (MT), automatic metrics are usually preferred over human evaluation, given time and budget constraints. However, traditional automatic metrics (such as BLEU) are not reliable for absolute quality assessment of documents, often producing similar scores for do...
متن کاملEvaluation of Reading Support Tools by Reading Comprehension Tests and Reading Speed Tests
This paper introduces our reading process monitoring systems and also presents the experimental results that show the adequacy of our reading data. Our system divides a text into reading areas and records reading time for each area. We conducted two experiments using this tool to verify the adequacy of our reading process data. In the first experiment, we examined whether the reading process ca...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Mech. Translat. & Comp. Linguistics
دوره 8 شماره
صفحات -
تاریخ انتشار 1965